Bandwidth mismatch compensation for robust speech recognition

نویسندگان

Yuan-Fu Liao

Jeng-Shien Lin

Wei-Ho Tsai

چکیده

In this paper, an iterative bandwidth mismatch compensation (BMC) algorithm is proposed to alleviate the need of multiple pre-trained models for recognizing different bandwidth speech. The BMC uses the concept of the bandwidth extension as similar as in the speech enhancement approaches. However, it aims at directly improving the recognition accuracy instead of speech intelligence or quality and utilizes only recognizer’s hidden Markov models (HMMs) for both bandwidth mismatch compensation and recognition. The BMC first detects the bandwidth of the input speech signal based on a divergence measurement. The HMM/Gaussian mixture model (GMM)based method is then used to iteratively segment the input speech utterance and compensates the speech features. Experiments on serious bandwidth mismatched conditions, i.e., training on 8 kHz and testing on 4 kHz or 5.5 kHz bandwidth database have verified the effectiveness of the proposed approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multivariate Cepstral Feature Compensation on Band-limited Data for Robust Speech Recognition

This paper describes a new method for compensating bandwidth mismatch for automatic speech recognition using multivariate linear combinations of feature vector components. It is shown that multivariate compensation is superior to methods based on linear compensations of individual features. Performance is evaluated on a real microphone-telephone mismatch condition (this involves noise compensat...

متن کامل

A theoretical bound for noise-robust speech recognition

Model compensation techniques for noise-robust speech recognition approximate the corrupted speech distribution. is work introduces a sampling method that, given speech and noise distributions and a mismatch function, in the limit calculates the corrupted speech likelihood exactly. For this, it transforms the integral in the likelihood expression, and then applies sequential importance resampli...

متن کامل

Asymptotically exact noise-corrupted speech likelihoods

Model compensation techniques for noise-robust speech recognition approximate the corrupted speech distribution. This paper introduces a sampling method that, given speech and noise distributions and a mismatch function, in the limit calculates the corrupted speech likelihood exactly. Though it is too slow to compensate a speech recognition system, it enables a more fine-grained assessment of c...

متن کامل

Robust distant speech recognition based on position dependent CMN using a novel multiple microphone processing technique

In a distant environment, channel distortion may drastically degrade speech recognition performances. In this paper, we propose a robust multiple microphone speech processing approach based on position dependent Cepstral Mean Normalization (CMN). In the training stage, the system measures the transmission characteristics according to the speaker positions from some grid points in the room and e...

متن کامل

An on-line acoustic compensation technique for robust speech recognition

In this work we report on the use of an on-line acoustic compensation technique for robust speech recognition. With this technique acoustic mismatch between training and actual conditions is reduced through acoustic mapping. At recognition stage, observation vectors delivered by the acoustic front-end are mapped into a reference acoustic space, while input data are exploited to update the stati...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Bandwidth mismatch compensation for robust speech recognition

نویسندگان

چکیده

منابع مشابه

Multivariate Cepstral Feature Compensation on Band-limited Data for Robust Speech Recognition

A theoretical bound for noise-robust speech recognition

Asymptotically exact noise-corrupted speech likelihoods

Robust distant speech recognition based on position dependent CMN using a novel multiple microphone processing technique

An on-line acoustic compensation technique for robust speech recognition

عنوان ژورنال:

اشتراک گذاری